Amendments to the Claims: 


This listing of claims will replace all prior versions, and listings of claims in the 
application. Applicant has submitted a new complete claim set showing marked up 
claims with insertions indicated by underlining and deletions indicated by strikeouts 
and/or double bracketing. 

Listing of Claims: 

1. (Currently amended) A computer i mp l emented method for m i n i m i z i ng 
effects of out li er data on data mode li ng comprising: 

selecting a modeling parameter from a plurality of modeling parameters 
characterizing a mixture of Student distribution components; 

computing an tractab l e approximation of a posterior distribution for the selected 
modeling parameter based on an input set of data and a current estimate of a posterior 
distribution of at least one unselected modeling parameter in the plurality of modeling 
parameters , computing the approximation being performed by a processor calculating 

NM 

q(s) = Y\pl- ^q{7r)=D (7r\a)^q{n m ) = N (n m \m m ,R m )^g{\ m ) = W (aJW,,,^), 

QLq(u nm ) = G (u nm \a nm ,b nm y, 

computing a lower bound of a log marginal likelihood as a function of current 
estimates of the posterior distributions of the modeling parameters, the current 
estimates of the posterior distributions of the modeling parameters including the 
computed tractab l e approximation of the posterior distribution of the selected modeling 
parameter; 
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determining if the lower bound has been satisfactorily optimized, wherein the 
lower bound is satisfactorily optimized when the computed lower bound has changed 
less than a threshold amount from a previously computed lower bound; 

generating a probability density modeling the input set of data, the probability 
density including the mixture of Student distribution components, the mixture of 
Student distribution components being characterized by the current estimates of the 
posterior distributions of the modeling parameters, when the lower bound is 
satisfactorily optimized; an4 

outputting the probability density , and 

outputting a number of speakers from the probability density. 

2. (Currently amended) The method of claim 1 wherein the computing 
operations comprise a first iteration and further comprising: 

selecting a different modeling parameter from the plurality of modeling 
parameters and repeating in a subsequent iteration the operations of computing an 
tractab l e approximation and computing a lower bound using the newly selected 
modeling parameter, 4f -when the lower bound is not satisfactorily optimized in the first 
iteration. 

3. (Original) The method of claim 1 wherein computing a lower bound 
comprises: 

computing the lower bound of the log marginal likelihood as a function of prior 
distributions of the modeling parameters. 
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4. (Currently amended) The method of claim 1 wherein computing an 
tractab l e approximation of a posterior distribution comprises: 

computing a variational approximation of the posterior distribution of the 
selected modeling parameter. 

5. (Original) The method of claim 1 wherein one of the plurality of modeling 
parameters represents a mean of each of the Student distribution components. 

6. (Original) The method of claim 1 wherein one of the plurality of modeling 
parameters represents a precision matrix of the Student distribution components. 

7. (Cancelled). 

8. (Original) The method of claim 1 wherein one of the plurality of modeling 
parameters represents a scaling parameter of a precision matrix of the Student 
distribution components. 

9. (Original) The method of claim 1 wherein one of the plurality of modeling 
parameters represents a mixing coefficients parameter of the Student distribution 
components. 

1 0. (Original) The method of claim 1 wherein generating a probability density 
comprises: 

generating the probability density including the mixture of Student distribution 
components, the mixture of Student distribution components being characterized by the 

Type of Response: AMENDMENT under 37 C.F.R. 1.111 
Application Number: 10/724,586 
Attorney Docket Number: 305414.01 
Filing Date: November 28, 2003 

4/28 


current estimates of the posterior distributions of the modeling parameters and an 
estimate of the number of degrees of freedom of each Student distribution component. 

1 1 . (Original) The method of claim 1 further comprising: 
storing the current estimates of the posterior distributions of the modeling 
parameters in a storage location. 

1 2. (Previously presented) The method of claim 1 wherein the input set of 
data represents auditory speech data from an unknown number of speakers. 

13. (Canceled). 

1 4. (Currently amended) A computer program product encoding a computer 
program for executing on a computer system a computer process for minimizing effects 
of outlier data on data modeling, the computer process comprising: 

selecting a modeling parameter from a plurality of modeling parameters 
characterizing a mixture of Student distribution components; 

computing an tractab l e approximation of a posterior distribution for the selected 
modeling parameter based on an input set of data and a current estimate of a posterior 
distribution of at least one unselected modeling parameter in the plurality of modeling 

N,M 

parameters , the current estimate being computed using /?(s|;r) = ^Q^"" A 

p{» m ) = N ( ] i m \m,pl)^p(A m ) = W (A m |W 0 , % ) i o L/? (^)= J D(^|a); 

computing a lower bound of a log marginal likelihood as a function of current 
estimates of the posterior distributions of the modeling parameters, the current 
estimates of the posterior distributions of the modeling parameters including the 
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computed tractab l e approximation of the posterior distribution of the selected modeling 
parameter; 

determining if the lower bound has been satisfactorily optimized, wherein the 
lower bound is satisfactorily optimized when the computed lower bound has changed 
less than a threshold amount from a previously computed lower bound; 

generating a probability density modeling the input set of data, the probability 
density including the mixture of Student distribution components, the mixture of 
Student distribution components being characterized by the current estimates of the 
posterior distributions of the modeling parameters, when the lower bound is 
satisfactorily optimized; and 

outputting the probability density ; and 

outputting a number of clusters from the probability density . 

1 5. (Currently amended) The computer program product of claim 1 4 
wherein the computing operations comprise a first iteration and further comprising: 

selecting a different modeling parameter from the plurality of modeling 
parameters and repeating in a subsequent iteration the operations of computing an 
tractab l e approximation and computing a lower bound using the newly selected 
modeling parameter, 4f -when the lower bound is not satisfactorily optimized in the first 
iteration. 

1 6. (Original) The computer program product of claim 1 4 wherein computing a 
lower bound comprises: 

computing the lower bound of the log marginal likelihood as a function of prior 
distributions of the modeling parameters. 
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1 7. (Currently amended) The computer program product of claim 1 4 
wherein computing a tractab l e approximation of a posterior distribution comprises: 

computing a variational approximation of the posterior distribution of the 
selected modeling parameter. 

1 8. (Original) The computer program product of claim 1 4 wherein one of the 
plurality of modeling parameters represents a mean of each of the Student distribution 
components. 

1 9. (Original) The computer program product of claim 1 4 wherein one of the 
plurality of modeling parameters represents a precision matrix of the Student 
distribution components. 

20. (Cancelled). 

2 1 . (Original) The computer program product of claim 1 4 wherein one of the 
plurality of modeling parameters represents a scaling parameter of a precision matrix of 
the Student distribution components. 

22. (Original) The computer program product of claim 1 4 wherein one of the 
plurality of modeling parameters represents a mixing coefficients parameter of the 
Student distribution components. 

23. (Original) The computer program product of claim 1 4 wherein generating a 
probability density comprises: 
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generating the probability density including the mixture of Student distribution 
components, the mixture of Student distribution components being characterized by the 
current estimates of the posterior distributions of the modeling parameters and an 
estimate of the degrees of freedom of each Student distribution component. 

24. (Original) The computer program product of claim 1 4 wherein the computer 
process further comprises: 

storing the current estimates of the posterior distributions of the modeling 
parameters in a storage location. 

25. (Currently Amended) The computer program product of claim 1 4 
wherein the input set of data represents auditory speech data from an unknown number 
of speakers , and wherein the number of clusters corresponds to the unknown number of 
speakers . 

26. (Previously presented) The computer program product of claim 1 4 
wherein the input set of data represents image segmentation data from images. 

27. (Currently amended) A system for m i n i m i z i ng effects of out li er data on 
data mode li ng comprising: 

a processor: 
a memory: 

a modeling parameter selector operable with the processor and memory to 
selectmen a modeling parameter from a plurality of modeling parameters characterizing 
a mixture of Student distribution components; 
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an tractab l e approximation module computing an tractab l e approximation of a 
posterior distribution for the selected modeling parameter based on an input set of data 
and a current estimate of a posterior distribution of at least one unselected modeling 
parameter in the plurality of modeling parameters; 

a lower bound optimizer module computing a lower bound of a log marginal 
likelihood as a function of current estimates of the posterior distributions of the 

modeling parameters using L {q^=\q{0)\x\\ ^T^\ d6 <ln/?(X), the current 

{ \ 

estimates of the posterior distributions of the modeling parameters including the 
computed tractab l e approximation of the posterior distribution of the selected modeling 
parameter, and determining if the lower bound has been satisfactorily optimized, 
wherein the lower bound is satisfactorily optimized when the computed lower bound 
has changed less than a threshold amount from a previously computed lower bound; 

a data model generator generating a probability density modeling the input set 
of data, the probability density including the mixture of Student distribution 
components, the mixture of Student distribution components being characterized by the 
current estimates of the posterior distributions of the modeling parameters, when the 
lower bound is satisfactorily optimized; ao4 

an output device outputting the probability density and outputtinq a number of 
clusters from the probability density . 

28. (Original) The system of claim 27 wherein the lower bound optimizer 
computes the lower bound of the log marginal likelihood as a function of prior 
distributions of the modeling parameters. 
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29. (Currently amended) The system of claim 27 wherein the tractab l e 
approximation module computes a variational approximation of the posterior 
distribution of the selected modeling parameter. 

30. (Original) The system of claim 27 wherein one of the plurality of modeling 
parameters represents a mean of each of the Student distribution components. 

31 . (Original) The system of claim 27 wherein one of the plurality of modeling 
parameters represents a precision matrix of the Student distribution components. 

32. (Cancelled). 

33. (Original) The system of claim 27 wherein one of the plurality of modeling 
parameters represents a scaling parameter of a precision matrix of the Student 
distribution components. 

34. (Original) The system of claim 27 wherein one of the plurality of modeling 
parameters represents a mixing coefficients parameter of the Student distribution 
components. 

35. (Original) The system of claim 27 wherein the data model generator 
generates the probability density including the mixture of Student distribution 
components, the mixture of Student distribution components being characterized by the 
current estimates of the posterior distributions of the modeling parameters and an 
estimate of the degrees of freedom of each Student distribution component. 
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36. (Original) The system of claim 27 further comprising: 

a memory storing the current estimates of the posterior distributions of the 
modeling parameters. 

37. (Currently Amended) The system of claim 27 wherein the input set of 
data represents auditory speech data from an unknown number of speakers , and 
wherein the number of clusters corresponds to the unknown number of speakers. 

38. (Previously presented) The system of claim 27 wherein the input set of 
data represents image segmentation data from images. 

39. (Currently amended) A computer i mp l emented method for m i n i m i z i ng 
effects of out li er data on data mode li ng comprising: 

computing an tractab l e approximation of a posterior distribution for a selected 
modeling parameter of a plurality of modeling parameters characterizing a mixture of 
Student distribution components based on an input set of data and a current estimate of 
a posterior distribution of at least one unselected modeling parameter in the plurality of 
modeling parameters , wherein computing the approximation is performed by a 

N,M 

processor calculating q(s) = p°™ ^q(7t)=D (n\a) i _q(yi m ) = N (n m |w m ,R m ) i 

q ( A m ) = W ( A m | W m , i] m ) ^or. q (u„ m ) = G (u„ m \a„ m , b nm ) ; 

determining whether current estimates of the posterior distributions of the 
modeling parameters are satisfactorily optimized in relation to a predetermined 
criterion, the current estimates of the posterior distributions of the modeling 
parameters including the computed tractab l e approximation of the posterior 
distribution of the selected modeling parameter; 
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modeling the input set of data by the mixture of Student distribution 
components, the mixture of Student distribution components being characterized by the 
current estimates of the posterior distributions of the modeling parameters; af*d 
outputting the modeling of the input set of data ; and 
outputting a number of clusters from the probability density. 

40. (Currently amended) The method of claim 39 wherein the computing 
operation and determining operation comprise a first iteration and further comprising: 

selecting a different modeling parameter from the plurality of modeling 
parameters and repeating in a subsequent iteration the operations of computing a 
tractab l e approximation and computing a lower bound using the newly selected 
modeling parameter, 4f -when the lower bound is not satisfactorily optimized in the first 
iteration. 

41 . (Previously presented) The method of claim 39 wherein the operation of 
determining whether current estimates of the posterior distributions of the modeling 
parameters are satisfactorily optimized comprises: 

computing a lower bound of the log marginal likelihood as a function of prior 
distributions of the modeling parameters and a variational posterior distribution; and 

determining whether the lower bound satisfies the predetermined criterion of the 
selected modeling parameter. 

42. (Currently amended) The method of claim 39 wherein computing a 
tractab l e approximation of a posterior distribution comprises: 

computing a variational approximation of the posterior distribution. 
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43. (Original) The method of claim 39 wherein one of the plurality of modeling 
parameters represents a mean of each of the Student distribution components. 

44. (Original) The method of claim 39 wherein one of the plurality of modeling 
parameters represents a precision matrix of the Student distribution components. 

45. (Cancelled). 

46. (Original) The method of claim 39 wherein one of the plurality of modeling 
parameters represents a scaling parameter of a precision matrix of the Student 
distribution components. 

47. (Original) The method of claim 39 wherein one of the plurality of modeling 
parameters represents a mixing coefficients parameter of the Student distribution 
components. 

48. (Original) The method of claim 39 wherein modeling the input data 
comprises: 

generating the probability density including the mixture of Student distribution 
components, the mixture of Student distribution components being characterized by the 
current estimates of the posterior distributions of the modeling parameters and an 
estimate of the degrees of freedom of each Student distribution component. 

49. (Original) The method of claim 39 further comprising: 

storing the current estimates of the posterior distributions of the modeling 
parameters in a storage location. 
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50. (Currently amended) A computer program product encoding a computer 
program for executing on a computer system a computer process for minimizing effects 
of outlier data on data modeling, the computer process comprising: 

computing an tractab l e approximation of a posterior distribution for a selected 
modeling parameter of a plurality of modeling parameters characterizing a mixture of 
Student distribution components based on an input set of data and a current estimate of 
a posterior distribution of at least one unselected modeling parameter in the plurality of 
modeling parameters, computing the approximation being performed by a processor 

calculating q(s) = ]~[ p s ™ ^q[n)=D (7r\a}^q(\i m ) = N (n m |w m ,R m ) i 

<l{ A m) = W ( A m| m W m > Vm ) ■ </(".-: ) = G (u nm \a„ m ,b nn )\ 

determining whether current estimates of the posterior distributions of the 
modeling parameters are satisfactorily optimized in relation to a predetermined 
criterion, the current estimates of the posterior distributions of the modeling 
parameters including the computed tractab l e approximation of the posterior 
distribution of the selected modeling parameter; 

modeling the input set of data by the mixture of Student distribution 
components, the mixture of Student distribution components being characterized by the 
current estimates of the posterior distributions of the modeling parameters; &r4 

outputting the modeling of the input set of data ; and 

outputting a number of clusters from the probability density . 
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5 1 . (Currently amended) The computer program product of claim 50 
wherein the computing operation and determining operation comprise a first iteration 
and further comprising: 

selecting a different modeling parameter from the plurality of modeling 
parameters and repeating in a subsequent iteration the operations of computing a 
tractab l e approximation and computing a lower bound using the newly selected 
modeling parameter, 4f -when the lower bound is not satisfactorily optimized in the first 
iteration. 

52. (Previously presented) The computer program product of claim 50 
wherein the operation of determining whether current estimates of the posterior 
distributions of the modeling parameters are satisfactorily optimized comprises: 

computing a lower bound of the log marginal likelihood as a function of prior 
distributions of the modeling parameters and a variational posterior distribution; and 
determining whether the lower bound satisfies the predetermined criterion. 

53. (Currently amended) The computer program product of claim 50 
wherein computing an tractab l e approximation of a posterior distribution comprises: 

computing a variational approximation of the posterior distribution of the 
selected modeling parameter. 

54. (Original) The computer program product of claim 50 wherein one of the 
plurality of modeling parameters represents a mean of each of the Student distribution 
components. 
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55. (Original) The computer program product of claim 50 wherein one of the 
plurality of modeling parameters represents a precision matrix of the Student 
distribution components. 

56. (Cancelled). 

57. (Original) The computer program product of claim 50 wherein one of the 
plurality of modeling parameters represents a scaling parameter of a precision matrix of 
the Student distribution components. 

58. (Original) The computer program product of claim 50 wherein one of the 
plurality of modeling parameters represents a mixing coefficients parameter of the 
Student distribution components. 

59. (Original) The computer program product of claim 50 wherein modeling the 
input data comprises: 

generating the probability density including the mixture of Student distribution 
components, the mixture of Student distribution components being characterized by the 
current estimates of the posterior distributions of the modeling parameters and an 
estimate of the degrees of freedom of each Student distribution component. 

60. (Original) The computer program product of claim 50 wherein the computer 
process further comprises: 

storing the current estimates of the posterior distributions of the modeling 
parameters in a storage location. 
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61 . (Currently amended) A system for m i n i m i z i ng effects of out li er data on 
data mode li ng comprising: 

a processor; 
a memory; 

a tractab l e approximation module operable with the processor and memory to 
comput i ng compute an tractab l e approximation of a posterior distribution for a selected 
modeling parameter of a plurality of modeling parameters characterizing a mixture of 
Student distribution components based on an input set of data and a current estimate of 
a posterior distribution of at least one unselected modeling parameter in the plurality of 

N,M 

modeling parameters by calculating <?(s) = ]~[ p s ™ ^q[7t)=D (^a)* 

q(n m ) = N (n m \m m ,R m )^q(A m ) = W (A m | \V m ,Tj m )^q(u nm ) = G (u^a^bj) ; 

an optimizer module determining whether current estimates of the posterior 
distributions of the modeling parameters are satisfactorily optimized in relation to a 
predetermined criterion, the current estimates of the posterior distributions of the 
modeling parameters including the computed tractab l e approximation of the posterior 
distribution of the selected modeling parameter; 

a data model generator modeling the input set of data by the mixture of Student 
distribution components, the mixture of Student distribution components being 
characterized by the current estimates of the posterior distributions of the modeling 
parameters; and 

an output device outputting the modeling of the input set of data and outputting 
a number of clusters from the probability density . 

62. (Previously presented) The system of claim 61 wherein optimizer module 
computes a lower bound of the log marginal likelihood as a function of prior 
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distributions of the modeling parameters and a variational posterior distribution, and 
determines whether the lower bound satisfies the predetermined criterion. 

63. (Currently amended) The system of claim 61 wherein the tractab l e 
approximation modules computes a variational approximation of the posterior 
distribution of the selected modeling parameter. 

64. (Original) The system of claim 61 wherein one of the plurality of modeling 
parameters represents a mean of each of the Student distribution components. 

65. (Original) The system of claim 61 wherein one of the plurality of modeling 
parameters represents a precision matrix of the Student distribution components. 

66. (Cancelled). 

67. (Original) The system of claim 61 wherein one of the plurality of modeling 
parameters represents a scaling parameter of a precision matrix of the Student 
distribution components. 

68. (Original) The system of claim 61 wherein one of the plurality of modeling 
parameters represents a mixing coefficients parameter of the Student distribution 
components. 
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69. (Original) The system of claim 61 wherein modeling the input data 
comprises: 

generating the probability density including the mixture of Student distribution 
components, the mixture of Student distribution components being characterized by the 
current estimates of the posterior distributions of the modeling parameters and an 
estimate of the degrees of freedom of each Student distribution component. 

70. (Original) The system of claim 61 further comprising: 

a memory storing the current estimates of the posterior distributions of the 
modeling parameters. 

71 . (Currently amended) The method of claim 1 further comprising where i n 
populating the input set of data i nc l udes with only observed data. 
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